Vector-based Representation and Clustering of Audio Using Onomatopoeia Words
نویسندگان
چکیده
We present results on organization of audio data based on their descriptions using onomatopoeia words. Onomatopoeia words are imitative of sounds that directly describe and represent different types of sound sources through their perceived properties. For instance, the word pop aptly describes the sound of opening a champagne bottle. We first establish this type of audio-to-word relationship by manually tagging a variety of audio clips from a sound effects library with onomatopoeia words. Using principal component analysis (PCA) and a newly proposed distance metric for word-level clustering, we cluster the audio data representing the clips. Due to the distance metric and the audio-to-word relationship, the resulting clusters of clips have similar acoustic properties. We found that as language level units, the onomatopoeic descriptions are able to represent perceived properties of audio signals. We believe that this form of description can be useful in relating higher-level descriptions of events in a scene by providing an intermediate perceptual understanding of the acoustic event.
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملAn Analysis on Visual Recognizability of Onomatopoeia Using Web Images and DCNN features
In this paper, we examine the relation between onomatopoeia and images using a large number of images over the Web. The objective of this paper is to examine if the images corresponding to Japanese onomatopoeia words which express the feeling of visual appearance can be recognized by the state-of-theart visual recognition methods. In our work, first, we collect the images corresponding to onoma...
متن کاملNamed Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملA Sound Symbolic Study of Translation of Onomatopoeia in Children's Literature: The Case of '' Tintin''
As onomatopoeic words or expressions are attractive, the users of languages in the fields of religion, literature, music, education, linguistics, trade, and so forth wish to utilize them in their utterances. They are more effective and imaginative than the simple words. Onomatopoeic words or expressions attach us to the real nature and to our inner senses. This study aims at familiarity with on...
متن کاملDistributional Clustering of Words for Text Categorization Research Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Science
We study an approach to text categorization that combines distributional clustering of words and a Support Vector Machine (SVM) classifier. The word-cluster representation is computed using the recently introduced Information Bottleneck method, which generates a compact and efficient representation of documents. When combined with the classification power of the SVM, this method yields high per...
متن کامل